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I give an account of my involvement with the chiral anomaly, and with the non- 
renormalization theorem for the chiral anomaly and the all orders calculation 
of the trace anomaly, as well as related work by others. I then briefly discuss 
implications of these results for more recent developments in anomalies in super- 
symmetric theories. 

> 

' 1. Introduction 

A distinguishing feature of the chiral and trace anomalies in gauge theory is that 
their coefficients can be determined to all orders of perturbation theory. My aims 
in this article are to survey the historical route leading to the discovery of this 
property of anomalies, and to review its current status. In the first part I discuss 
the nonrenormalization of the chiral anomaly, in its historical context and then 
from a modern perspective. In the second part I discuss the closely analogous results 
|' linking the coefficient of the trace anomaly to the renormalization group function. 

In the third part, I conclude by discussing the interplay of these two results in the 
context of supersymmetric theories. 
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2. Nonrenormalization of the Chiral Anomaly 
2.1. Setting the Stage 

My 1969 paper on anomalies [H] consisted of two parts. The first was a discussion 
of the axial anomaly in spinor electrodynamics, and represented work done during 
the spring and summer of 1968 and written up in longhand form (awaiting typing 
on my return to Princeton) while I was at the Aspen Center of Physics. The sec- 
ond part consisted of an Appendix and additional footnotes, written after Sidney 
Coleman arrived in Aspen towards the end of my stay there, and told me about the 
independent work done by Bell and Jackiw |19j on the anomaly in the context of 
the partially conserved axial current (PCAC) calculation of ir° — > 2j decay in the 
sigma model. Both parts are relevant to the story of the anomaly nonrenormaliza- 
tion theorem, and so I shall begin by discussing them in some detail. 

I got into the subject of anomalies in an indirect way, through exploration dur- 
ing 1967-1968 of the speculative idea that the muon-electron mass difference could 
be accounted for by giving the muon an additional magnetic monopole electromag- 
netic coupling through an axial- vector current, which somehow was nonperturba- 
tively renormalized to zero. After much fruitless study of the integral equations 
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for the axial- vector vertex part, I decided in the spring of 1968 to first try to an- 
swer a well-defined question, which was whether the axial-vector vertex in QED 
was renormalized by multiplication by Z2, as I had been implicitly assuming. At 
the time when I turned to this question, I had just started a 6-week visit to the 
Cavendish Laboratory in Cambridge, England after flying to London with my family 
on April 21, 1968. a In the Cavendish I shared an office with my former adviser Sam 
Treiman, and was enjoying the opportunity to try a new project not requiring ex- 
tensive computer analysis; I had only a month before finished my Annals of Physics 
paper [2] on weak pion production, which had required extensive computation, not 
easy to do in those days when one had to wait hours or even a day for the results 
of a computer run. 

My interest in the multiplicative renormalization question had been piqued by 
work of van Nieuwenhuizen, in which he had attempted to demonstrate the finitc- 
ness to all orders of radiative corrections to /i-decay, using an argument based on 
subtraction of renormalization constants that I knew to be incorrect beyond leading 
order. I had learned about this work during the previous summer, when I was a lec- 
turer at the Varenna summer school held by Lake Como from July 17-29, 1967, at 
which van Nieuwenhuizen had given a seminar on this topic that was critiqued by 
Bjorken, another lecturer.* 3 Working in the old Cavendish, I rather rapidly found an 
inductive multiplicative renormalizability proof, paralleling the one in Bjorken and 
Drell |22 for finiteness of Z2 times the vector vertex. I prepared a detailed outline 
for a paper describing the proof, but before writing things up, I decided as a check 
to test whether the formal argument for the closed loop part of the Ward identity 
worked in the case of the smallest loop diagram. This is a triangle diagram with one 
axial and two vector vertices (the AVV triangle; see Fig. 1(a)), which because of 
Furry's theorem (C invariance) has no analog in the vector vertex case. I knew from 
a student seminar that I had attended during my graduate study at Princeton that 
this diagram had been explicitly calculated using a gauge-invariant regularization 
by Rosenberg [H2j, who was interested in the astrophysical process 7y + v — ► 7 + v, 
with "fy a virtual photon emitted by a nucleus. I got Rosenberg's paper, tested the 



a Thcsc details of dates were recorded by my ex-wife Judith in my oldest daughter Jessica's "baby 
book" . 

b I wish to thank Peter van Nieuwenhuizen for a phone conversation clarifying this part of the 
history. In my 1998 Dirac lecture ;6; , and several archival historical accounts, I had written "My 
interest in the multiplicative renormalization question had been piqued by a preprint I received 
from Peter van Nieuwenhuizen, in which he attempted to show that the axial vector vertex is made 
finite by the usual renormalizations, using an argument based on subtractive renormalization that 
I saw was incorrect." On reexamining my files in the course of preparing this article, I found that 
my copy of van Nieuwenhuizen's undated preprint entitled "Finiteness of radiative corrections in 
all orders to /i-decay" had been sent, in response to a letter I wrote to Veltman, by Veltman's 
secretary Ms. Rietveld, who wrote a cover letter dated May 28, 1968 expressing the hope that the 
preprint would reach me while I was still in England. Thus I could not have seen the preprint until 
the end of my stay in Cambridge. I do not have a copy of my letter to Veltman inquiring about a 
preprint version of van Nieuwenhuizen's talk, to which Ms. Rietveld's letter was a response. 
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Ward identity, and to my astonishment (and Treiman's when I told him the result) 
found that it failed! I soon found that the problem was that my formal proof used 
a shift of integration variables inside a linearly divergent integral, which (as I again 
recalled from student reading) had been analyzed in an Appendix to the classic text 
of Jauch and Rohrlich with a calculable constant remainder. For all closed loop 
contributions to the axial vertex in Abelian electrodynamics with larger numbers 
of vector vertices (the AVVVV, AVVVVVV loops; see Fig. 1(b)), the fermion 
loop integrals for fixed photon momenta are highly convergent and the shift of inte- 
gration variables needed in the Ward identity is valid, so proceeding in this fermion 
loop-wise fashion there were apparently no further additional or "anomalous" con- 
tributions to the axial-vector Ward identity. With this fact in the back of my mind 
I was convinced from the outset that the anomalous contribution to the axial Ward 
identity would come just from the triangle diagram, with no renormalizations of the 
anomaly coefficient arising from higher order AVV diagrams with virtual photon 
insertions. 
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Fig. 1. Fermion loop diagram contributions to the axial- vector vertex part. Solid lines are 
fermions, and dashed lines are photons, (a) The smallest loop, the AVV triangle diagram, (b) 
Larger loops with four or more vector vertices, which (when summed over vertex orderings) obey 
normal Ward identities. 



In early June, at the end of my 6 weeks in Cambridge, I returned to the US and 
then went to Aspen, where I spent the summer working out a manuscript on the 
properties of the axial anomaly, which became the body (pages 2426-2434) of the 
final published version [3] . Several of the things done there will figure in the further 
discussion of anomaly nonrenormalization. The first was a calculation of the field 
theoretic form of the anomaly, giving the now well-known result 

d»j*(x) = 2im j 5 (x) + ^ F ^{x)F^{x)e iaTp , (1) 

with j{j = ^l^lbi 3 the axial- vector current (referred to above as A), j 5 = V'75'0 the 
pseudoscalar current, and with mo and «o the (unrenormalizcd) fermion mass and 
coupling constant. The second was a demonstration that because of the anomaly, 
is no longer the multiplicative renormalization constant for the axial- vector vertex, 
as a result of the diagram drawn in Fig. 1(a) in which the AVV triangle is joined to 
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an electron line with two virtual photons. Instead, the axial-vector vertex is made 
finite by multiplication by the renormalization constant 

Z A = Z 2 [1 + ^(a A) 2 log(A 2 /m 2 ) + ...] , (2) 

thus giving an answer to the question with which I started my investigation. Thirdly, 
as an application of this result, I showed that the anomaly leads, in fourth order of 
perturbation theory, to infinite radiative corrections to the current-current theory 
of v^ii and v e e scattering, but that this infinity can be cancelled between different 
fermion species by adding appropriate v^e and v e fi scattering terms to the La- 
grangian. This result is related to the fact, also discussed in my paper, that the 
asymptotic behavior of the AVV triangle diagram saturates the bound given by 
the Weinberg power counting rules, rather than being one power better as is the 
case for the AVVVV and higher loop diagrams. Finally, I also showed that a gauge 
invariant chiral generator still exists in the presence of the anomaly. Although not 
figuring in our subsequent discussion here, in its non-Abelian generalization this 
was relevant (as reviewed in Coleman ,30 ) to later discussions of the U(l) problem 
in quantum chromodynamics (QCD), leading up to the solution given by 't Hooft 

No sooner was this part of my paper completed than Sidney Coleman arrived in 
Aspen from Europe, and told me that Bell and Jackiw (published as Bell and Jackiw, 
had independently discovered the anomalous behavior of the AVV triangle 
graph, in the context of a sigma model investigation of the Veltman |99|-Sutherland 
[HSj theorem stating that 7r° — > 77 decay is forbidden in a PCAC calculation. The 
Sutherland- Veltman theorem is a kinematic statement about the AVV three-point 
function, which asserts that if the momenta associated with the currents A, V, V are 
respectively q, k\ , ki , then the requirement of gauge invariance on the vector currents 
forces the AVV vertex to be of order qk\k 2 in the external momenta. Hence when 
one applies a divergence to the axial-vector vertex and uses the standard PCAC 
relation (with the quark current .FjL the analog of ) 

d^(x) = (f n /V2)Mx) , (3) 

with (fin the pion field and the charged pion decay constant, one finds that the 
7T° — ► 77 matrix element is of order q 2 kik 2 , and hence vanishes in the soft pion 
limit q 2 — > 0. Bell and Jackiw analyzed this result by a perturbative calculation in 
the er-model, in which PCAC is formally built in from the outset, and found a non- 
vanishing result for the tt° — > 77 amplitude, which they traced back to the fact that 
the regularized AVV triangle diagram cannot be defined to satisfy the requirements 
of both PCAC and gauge invariance. This constituted the "PCAC Puzzle" referred 
to in the title of their paper. They then proposed to modify the original cr-modcl 
by adding further regulator fields with mass-dependent coupling constants in such 
a manner as to simultaneously enforce gauge invariance and PCAC, thus enforcing 
the Sutherland- Veltman prediction of a vanishing ir° — ► 77 decay amplitude. In the 
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words of Bell and Jackiw in their paper, "It has to be insisted that the introduction 
of this mass dependence of coupling constants is not an arbitrary step in the PCAC 
context. If a regularization is introduced to define the theory, it must respect any 
formal properties which are to be appealed to." And again in concluding their paper, 
they stated "To the complaint that we have changed the theory, we answer that 
only the revised version embodies simultaneously the ideas of PCAC and gauge 
invariance." 

It was immediately clear to me, in the course of the conversation with Sidney 
Coleman, that introducing additional regulators to eliminate the anomaly would 
entail renormalizability problems in a meson scattering, and was not the correct 
way to proceed. However, it was also clear that Bell and Jackiw had made an 
important observation in tying the anomaly to the Sutherland-Veltman theorem 
for 7T° — * 77 decay, and that I could use the sigma-model version of Eq. to get 
a nonzero prediction for the ir° — > 77 amplitude, with the whole decay amplitude 
arising from the anomaly term. I then wrote an Appendix to my paper (pages 2434- 
2438), clearly delineated from the manuscript that I had finished before Sidney's 
arrival, in which I gave a detailed rebuttal of the regulator construction, by showing 
that the anomaly could not be eliminated without spoiling either gauge-invariancc 
or renormalizability. (in later discussions I added unitarity to this list, to exclude 
the possibility of canceling the anomaly by adding a term to the axial current with 
a d fl /(d\) 2 singularity.) In this Appendix I also used an anomaly modified PCAC 
equation 

d^Ux) = (f w /V2)Mx) + S^-F^(x)F^(x)e ir7Tp , (4) 

with S a constant determined by the constituent fermion charges and axial-vector 
couplings, to obtain a PCAC formula for the ir° — > 77 amplitude F (with [i v the 
pion mass) 

F = -(a/7r)2SV2 fJ 2 JU ■ (5) 

Although the axial anomaly, in the context of breakdown of the "pseudoscalar- 
pseudovector equivalence theorem" , had in fact been observed much earlier, starting 
with Fukuda and Miyamoto jlH] and Steinberger and continuing to Schwinger 
|83j . my paper broke new ground by treating the anomaly neither as a baffling 
calculational result, nor as a field theoretic artifact to be eliminated by a suitable 
regularization scheme, but instead as a real physical effect (breaking of classical 
symmetries by the quantization process) with observable physical consequences. 

This point of view was not immediately embraced by everyone else. After com- 
pleting my Appendix I sent Bell and Jackiw copies of my longhand manuscript, and 
an interesting correspondence ensued. In a letter dated August 25, 1968, Jackiw was 
skeptical whether one could extract concrete physical predictions from the anomaly, 
and whether one could augment the divergence of the axial-vector current by a 
definite extra electromagnetic contribution, as in the modified PCAC equation of 
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Eq. 0. Bell, who was traveling, wrote me on Sept. 2, 1968, and was more appre- 
ciative of the possibility of using a modified PCAC to get a formula for the neutral 
pion decay amplitude, writing "The general idea of adding some quadratic electro- 
magnetic terms to PCAC has been in our minds since Sutherland's rj problem. We 
did not see what to do with it." He also defended the approach he and Jackiw had 
taken, writing "The reader may be left with the impression that your development 
is contradictory to ours, rather than complementary. Our first observation is that 
the a model interpreted in a conventional way just does not have PCAC. This is 
already a resolution of the puzzle, and the one which you develop in a very nice 
way. We, interested in the cr-model only as exemplifying PCAC, choose to modify 
the conventional procedures, in order to exhibit a model in which general PCAC 
reasoning could be illustrated in explicit calculation." In recognition of this letter 
from John Bell, whom I revered, I added a footnote 15 to my manuscript saying 
"Our results do not contradict those of Bell and Jackiw, but rather complement 
them. The main point of Bell and Jackiw is that the a model interpreted in the 
conventional way, does not satisfy the requirements of PCAC. Bell and Jackiw mod- 
ify the a model in such a way as to restore PCAC. We, on the other hand, stay 
within the conventional er model, and try to systematize and exploit the PCAC 
breakdown." This footnote, which contradicts statements made in the text of my 
paper, has puzzled a number of people; in retrospect, rather than writing it as a 
paraphrase of Bell's words, I should have quoted directly from Bell's letter. 

Following this correspondence, my paper was typed on my return to Princeton 
and was received by Physical Review on Sept. 24, 1968. (Bell and Jackiw's paper 
|19j. a CERN preprint dated July 16, 1968, was submitted to II Nuovo Cimento, 
and received by that journal on Sept. 11, 1968). My paper was accepted along with 
a signed referee's report from Bjorken, stating "This paper opens a topic similar 
to the old controversies on photon mass and nature of vacuum polarization. The 
lesson there, as I (no doubt foolishly) predict will happen here, is that infinities 
in diagrams are really troublesome, and that if the cutoff which is used violates a 
cherished symmetry of the theory, the results do not respect the symmetry. I will 
also predict a long chain of papers devoted to the question the author has raised, 
culminating in a clever renormalizablc cutoff which respects chiral symmetry and 
which, therefore, removes Adler's extra term." Thus, acceptance of the point of 
view that I had advocated was not immediate, but only followed over time. In 
1999, Bjorken was a speaker at my 60th birthday conference at the Institute for 
Advanced Study, and amused the audience by reading from his report, and then 
very graciously gave me his file copy, with an appreciative inscription, as a souvenir. 

The viewpoint that the anomaly determined the 7r° — > 77 decay amplitude had 
significant physical consequences. In the Appendix to my paper, I showed that the 
value of the parameter S implied by the fractionally charged quark model gave a 
decay amplitude that was roughly a factor of 3 too small, whereas assuming inte- 
grally charged quarks gave an amplitude that agreed, within the expected accuracy 
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of PCAC, with experiment. In a conference talk a year later, in September 1969 0] 
I reviewed the situation, and noted that the integrally charged triplet model of Han 
and Nambu [S3] (see also Tavkhelidze jSJ ) also agreed with the experimental neu- 
tral pion decay amplitude. These were the first indications that neutral pion decay 
provides empirical evidence for what we now call the "color" degree of freedom of 
the strong interactions. For recent Archive reprintings of some of the seminal papers 
on color, see Bardeen, Fritzsch, and Gell-Mann (hep-ph/0211388) and Fritzsch and 
Gell-Mann (hep-ph/0301127). Standard references for anomaly physics as of 1970 
are my Brandeis lectures |Hj and Jackiw's Brookhaven lectures |55| . 

Before leaving the subject of the early history of the anomaly and its antecedents, 
perhaps this is the appropriate place to mention the paper of Johnson and Low |59j , 
which showed that the Bjorken [23 -Johnson-Low [SH] (BJL) method of identifying 
formal commutators with an infinite energy limit of Feynman diagrams gives, in 
significant cases, results that differ from the naive field-theoretic evaluation of these 
commutators. This method was later used by Jackiw and Johnson |5(:i| and by 
Boulware and myself (Adler and Boulware 9 ) to show that the AW axial anomaly 
can be reinterpreted in terms of anomalous commutators. This line of investigation, 
however, did not readily lend itself to a determination of anomaly effects beyond 
leading order. For example, I still have in my files an unpublished manuscript (circa 
1966) attempting to use the BJL method to tackle a simpler problem, that of proving 
that the Schwinger term in quantum electrodynamics (QED) is a c-number to all 
orders of perturbation theory. I believe that this result is true (and it may well have 
been proved by now using operator product expansion methods) , but I was not able 
at that time to achieve sufficient control of the BJL limits of high order diagrams 
with general external legs to give a proof. 



2.2. Anomaly Nonrenormalization 

We are now ready to turn to our main story, the determination of anomalies beyond 
leading order in perturbation theory. Before the neutral pion low energy theorem 
could be used as evidence for the charge structure of quarks, one needed to be 
sure that there were no perturbative corrections to the anomaly and the low energy 
theorem following from it. As I noted above, the fermion loop-wise argument that 
I used in my original treatment left me convinced that only the lowest order AW 
diagram would contribute to the anomaly, but this was not a proof. This point 
of view was challenged in the article by Jackiw and Johnson |56j , received by the 
Physical Review on Nov. 25, 1968, who stated "Adler has given an argument to 
the end that there exist no higher-order effects. He introduced a cutoff, calculated 
the divergence, and then let the cutoff go to infinity. This is seen in the present 
context to be equivalent to the second method above. However, we believe that this 
method may not be reliable because of the dependence on the order of limits." And 
in their conclusion, they stated "In a definite model the nature of the modification 
(to the axial-vector current divergence equation) can be determined, but in general 
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only to lowest order in interactions" . This controversy with Jackiw and Johnson 
was the motivation for a more thorough analysis of the nonrenormalization issue 
undertaken by Bill Bardeen and myself in the fall and winter of 1968-1969 (Adler 
and Bardeen [H] ) and was cited in the "Acknowledgments" section of our paper, 
where we thanked "R. Jackiw and K. Johnson for a stimulating controversy which 
led to the writing of this paper." 

The paper with Bardeen approached the problem of nonrenormalization by two 
different methods. We first gave a general constructive argument for nonrenormal- 
ization of the anomaly to all orders, in both quantum electrodynamics and in the 
cr-model in which PCAC is canonically realized, and we then backed this argu- 
ment up with an explicit calculation of the leading order radiative corrections to 
the anomaly, showing that they cancelled among the various contributing Feynman 
diagrams. The strategy of the general argument was to note that since Eqs. Q and 
Q involve unrenormalized fields, masses, and coupling constants, these equations 
are well defined only in a cutoff field theory. Thus, for both electrodynamics and 
the er-model, we constructed cutoff versions by introducing photon or c-meson reg- 
ulator fields with mass A. (This was simple for the case of electrodynamics, but 
more difficult, relying heavily on Bill Bardeen's prior experience with meson field 
theories, in the case of the cr-model.) In both cases, the cutoff prescription allows 
the usual renormalization program to be carried out, expressing the unrenormalized 
quantities in terms of renormalized ones and the cutoff A. In the cutoff theories, 
the fermion loop-wise argument I used in my original anomaly paper is still valid, 
because regulating boson propagators does not alter the chiral symmetry properties 
of the theory, and thus it is straightforward to prove the validity of Eqs. Q and (0J 
for the unrenormalized quantities to all orders of perturbation theory. 

Taking the vacuum to two 7 matrix element of the anomaly equations, and ap- 
plying the Sutherland- Veltman theorem, which asserts the vanishing of the matrix 
element of at the special kinematic point q 2 — 0, Bardeen and I then got exact 
low energy theorems for the matrix elements (27|2imoj 5 |0) (in electrodynamics) 
and (27|(/ 7r /v / 2)07r|O) (in the cr-model) of the "naive" axial- vector divergence at 
this kinematic point, which were given by the negative of the corresponding ma- 
trix element of the anomaly term. However, since we could prove that these matrix 
elements are finite in the limit as the cutoff A approaches infinity, this in turn 
gave exact low energy theorems for the renormalized, physical matrix elements in 
both cases. One subtlety that entered into the all orders calculation was the role 
of photon rescattering diagrams connected to the anomaly term, but using gauge 
invariance arguments analogous to those involved in the Sutherland- Veltman the- 
orem, we were able to show that these diagrams made a vanishing contribution to 
the low energy theorem at the special kinematic point q 2 = 0. Thus, my paper with 
Bardeen provided a rigorous underpinning for the use of the 7r° — > 77 low energy 
theorem to study the charge structure of quarks. 

In our explicit second order calculation, we calculated the leading order radia- 
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tive corrections to this low energy theorem, arising from addition of a single virtual 
photon or virtual cr-meson to the lowest order diagram. We did this by two meth- 
ods, one involving a direct calculation of the integrals, and the other (devised by 
Bill Bardeen) using a clever integration by parts argument to bypass the direct 
calculation. Both methods gave the same answer: the sum of all the radiative cor- 
rections is zero, as expected from our general nonrenormalization argument. We 
also traced the contradictory results obtained in the paper of Jackiw and Johnson 
to the fact that these authors had studied an axial-vector current (such as ^7^75^ 
in the a-model) that is not made finite by the usual renormalizations in the ab- 
sence of electromagnetism; as a consequence, the naive divergence of this current 
is not multiplicatively renormalizable. As we noted in our paper, "In other words, 
the axial- vector current considered by Jackiw and Johnson and its naive divergence 
are not well-defined objects in the usual renormalizcd perturbation theory; hence 
the ambiguous results which these authors have obtained are not too surprising." 
Our result of a definite, unrenormalized low energy theorem, we noted, came about 
because "In each model we have studied a particular axial- vector current: in spinor 
electrodynamics, the usual axial- vector current ... and in the a model the Polk- 
inghornc 80, 8fJ axial- vector current ... which, in the absence of electromagnetism, 
obeys the PCAC condition." It is these axial-vector currents that obey a simple 
anomaly equation to all orders in perturbation theory, and which give an exact, 
physically relevant low energy theorem for the naive axial-vector divergence. 

This paper with Bill Bardeen should have ended the controversy over whether 
the anomaly was renormalized, but it didn't. Johnson pointed out in an unpublished 
report that since the anomaly is mass-independent, it should be possible to calculate 
it in massless electrodynamics, for which the naive divergence 2imoj vanishes and 
the divergence of the axial-vector current directly gives the anomaly. Moreover, in 
massless electrodynamics there is no need for mass renormalization, and so if one 
chooses Landau gauge for the virtual photon propagator, the second order radiative 
correction calculation becomes entirely ultraviolet finite, with no renormalization 
counter terms needed. Such a second order calculation was reported by Sen |86| . a 
Johnson student, who claimed to find nonvanishing second order radiative correc- 
tions to the anomaly. However, the calculational scheme proposed by Johnson and 
used by Sen has the problem that, while ultraviolet finite, there arc severe infrared 
divergences, which if not handled carefully can lead to spurious results. After a long 
and arduous calculation (Adler, Brown, Wong, and Young JOj) m y collaborators 
and I were able to show that the zero mass calculation, when properly done, also 
gives a vanishing second order radiative correction to the anomaly. This confirmed 
the result I had found with Bardeen, which had by then also been confirmed by 
different methods in the mo ^ theory in papers of Abers, Dicus, and Teplitz pQ 
and Young, Wong, Gounaris, and Brown |1U7| . 

A second challenge to the nonrenormalization calculation that Bardeen and I 
had done came from DeRaad, Milton, and Tsai , a group associated with Julian 
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Schwinger at UCLA (preprint received by Physical Review March 27, 1972; revised 
version received by Physical Review May 1, 1972, and a second paper (Milton, Tsai, 
and DeRaad [72]) with further calculational details written shortly afterwards). 
They calculated the radiative corrected low energy theorem using a source-theoretic 
method, and in their preprint, they claimed a renormalization factor of 1 + a/(2n), 
in disagreement with what Bardeen and I, and other groups, had found. I learned 
of their work from Bing-Lin Young (one of my collaborators in the uiq = cal- 
culation), who wrote to me on March 29, 1972, asking me to look at the UCLA 
group's preprint, a copy of which he enclosed. I did a short calculation, and then in 
conversations with the UCLA group pointed out that they had imposed an ad hoc 
normalization condition on the pseudoscalar vertex T§ associated with the naive di- 
vergence 2imoj 5 . However, the anomalous Ward identity for the axial- vector vertex 
part uniquely specifies the normalization to be used, since at q = in momen- 
tum space both the <9 M j^ terms and the anomaly term vanish, and one is left with 
= 2moT^(jp,p) + S' F (p) + ^S' F (p)~ 1 . This equation relates the normalization 
of T§(p,p) to that of the electron propagator S' F (p), and must be used as the nor- 
malization of the pseudoscalar vertex in computing radiative corrections to the low 
energy theorem (see equations (26) through (29) of my paper with Bardeen [H]), 
even though it gives an infrared singular value for the on-shell vertex part T§ (p, p) . 
DeRaad, Milton, and Tsai in the revised, published version of their paper |38|. 
which acknowledged conversations with me, recalculated their results for a general 
Y§ normalization point and found a radiative correction 1 + (1 + 8)a/(2ir), with 8 
a parameter related to the normalization point. They showed that the normaliza- 
tion that Bardeen and I had used corresponded to 8 = — 1, that is, no radiative 
correction, in agreement with our calculational result, whereas their original choice 
of normalization at zero momentum transfer squared corresponded to 8 = 0. They 
then argued that since the normalization that Bardeen and I had used corresponds 
to normalization at an infrared sensitive four momentum transfer 3m 2 / log(/z 2 /m 2 ), 
with fx the fictitious photon mass, their normalization choice of zero four momentum 
transfer squared is more natural. However, this argument is spurious because the in- 
frared sensitivity on which it is based is an artifact of the calculational method that 
they used. In fact, the vacuum to two photon matrix element of the naive divergence 
is completely infrared finite, since any sensitive dependence on the fictitious photon 
mass used to calculate vertex parts and electron propagators cancels around the 
closed electron loop. Hence there is no justification for using a normalization con- 
vention differing from the one explicitly specified by the axial-vector Ward identity. 
Of course, if one insists on using a different normalization from that determined 
by the Ward identity, a non-vanishing radiative correction to the anomaly will re- 
sult. As far as I can tell, Schwinger never accepted the nonrenormalization of the 
anomaly (see the Schwinger biography by Mehra and Milton [JT] , footnote on page 
488, and pp. 298-310 of Schwinger jHS] )• 

Almost two decades later, in 1989, another challenge to the Adler-Bardeen the- 
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Fig. 2. Photon rescattering contribution to the basic AW triangle, shown in (a), is identical to 
the triangle contribution to the axial- vector vertex part from Fig. 1(a) used in place of the lowest 
order axial- vector vertex in a basic AVV triangle, as shown in (b). 



orem appeared, this time in a paper of Ansel'm and Iogansen (Johansen) [15] , again 
in the context of massless electrodynamics. These authors showed that when the 
AVV triangle is coupled to a light-by-light scattering diagram, one finds that the 
matrix element of <9 M j^ in a background electromagnetic field is divergent, 

(^) = <2^F^U(l-^lQg^) , (6) 

with k a typical external momentum. This result is just what one would expect from 
using the axial-vector vertex part constructed from the elementary AVV triangle 
as the axial vertex in an AVV triangle, as shown in Fig. 2, which corresponds to 
the calculation done by Ansel'm and Iogensen. And in fact, as noted by them, the 
divergence appearing in Eq. © corresponds to the anomaly-induced divergence in 
the axial- vector vertex part, which is made finite by the renormalization factor of 
Eq. © that I gave in my 1969 paper 0, in other words, 

[f + J( ao / 7 r) 2 log(A 2 /m 2 ) + ..r 1 = l-^_logA 2 /m 2 . (7) 

Moreover, Ansel'm and Iogensen stated in their paper that the operator coeffi- 
cient of the anomaly term has no renormalizations. However, in asserting that the 
Adler-Bardeen theorem is incorrect, and in particular that the ir° — ► 77 low energy 
theorem needs radiative corrections, they made the mistake of confusing matrix 
elements of the axial-vector divergence d^j^ with matrix elements of the naive di- 
vergence 2imoj 5 . In terms of matrix elements, the Adler-Bardeen theorem asserts 
only that the vacuum to two-photon matrix element of the naive divergence has a 
known value, with no renormalizations to all orders, at a particular kinematic point 
at which external momenta are small compared to the fermion mass. (This kinematic 
point is needed, we recall, to be able to invoke the Sutherland- Veltman theorem to 
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assert vanishing of the axial- vector divergence or d^j^ side of the anomaly equation, 
and to assert the vanishing of photon rescattering contributions from the anomaly, 
so that the naive divergence matrix element is given by the lowest order matrix 
element of minus the anomaly.) When the naive divergence is the pion field, as in 
the anomaly-modified PCAC equation, the resulting low energy theorem has useful 
physical consequences, as discussed above. In massless electrodynamics, the naive 
divergence operator vanishes, and the kinematic point at which the low energy the- 
orem holds cannot be attained (q 2 /m 2 is infinite no matter how small q 2 is), so 
there is no obvious analog of the ir° — > 77 low energy theorem, which is probably 
why Ansel'm and Iogensen instead discuss matrix elements of d^j^, which in the 
massless case is given by just the anomaly term. However, Bardeen and I never 
claimed that generic matrix elements of the axial-vector divergence, or the anomaly 
term, have no radiative corrections; such statements would obviously be false, as 
already shown in my 1969 anomaly paper f|. Thus, the main calculational results of 
the Anselm- Iogensen paper are correct, but they make interpretational statements 
that are incorrect, by misunderstanding what the Adler-Bardeen theorem says. I 
gave an analysis similar to this in a letter dated July 27, 1989 to Ansel'm, and again 
in a letter dated October 13, 1997 to Dan Freedman who passed it on to Johansen 
(who was visiting MIT at the time), but unfortunately I never received a response 
from either Ansel'm or from Johansen. 

Long before these events, however, the analysis of anomaly renormalization by 
calculation of higher order Feynman diagrams had been rendered obsolete by new, 
non-perturbative methods. Zee 108 and Lowenstein and Schroer gave direct 
proofs of chiral anomaly non-renormalization in QED starting from the then recently 
discovered Callan-Symanzik equations, and an alternative argument was given in 
the book of Collins [32] . (For extensions to the cr-model, see also Becchi ^B] and 
Shei and Zee [HZj-) Before describing Zee's argument, let me make some technical 
comments about the three proofs. In the paper of Zee the operator dA is to be 
understood as the "naive" divergence of the axial- vector current; in the paper of 
Schroer and Lowenstein this identification is made explicit. (When Zee's argument 
is applied to prove nonrenormalization of a flavor non-singlet current to all orders in 
strong interactions, this clarification is not needed, since the only anomaly term in 
the axial-vector divergence then is electromagnetic, and can be neglected in a strong 
interaction calculation. However, when Zee's argument is applied to prove anomaly 
nonrenormalization in QED, the specification that dA is just the naive divergence 
is needed.) The papers of Zee and of Schroer and Lowenstein both work at the point 
q 2 = where the naive divergence is related to the anomaly coefficient. Collins, by 
contrast, gives an argument based on renormalization properties of the axial- vector 
vertex holding at a general value of q. c In his presentation, Collins specializes to 
flavor non-singlet axial-vector currents in QCD, such as the flavor SU(3) octet 



C I wish to thank Tony Zee and John Collins for phone conversations and emails on these issues. 
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current .Ff of Eq. J3J, which permits him to ignore complications arising from the 
gluon rescattering contributions considered by by Ansel'm and Iogensen. However, 
the Collins version of Zee's proof can be extended to singlet currents, such as the 
axial- vector current in QED or the flavor singlet U(l) axial- vector current in QCD, 
by replacing the general axial-vector vertex in his argument by the two photon 
irreducible axial- vector vertex (or its analog in QCD). (The general AVV three- 
point function, without a two-photon irreducible restriction, is obtained from this 
simpler three-point function by summing over the possibilities of zero, one, two, 
... photon-photon rescatterings as shown in Fig. 3.) The anomaly associated with 
the two-photon irreducible AVV three-point function directly corresponds to the 
operator anomaly, since diagrams that iterate the two-photon irreducible three-point 
function with photon rescatterings, as in Fig. 3, are in one-to-one correspondence 
with diagrams that iterate the operator anomaly with photon rescatterings. 




Fig. 3. The full AVV three-point function is obtained from the two-photon irreducible AVV 
three-point function (shaded triangle) by adding the contributions of one or more photon rescat- 
terings. 



Briefly, Zee's argument is based on consideration of the three-point function 
Tnoj V^Vv formed from the naive axial current divergence and from vector cur- 
rents carrying four-momenta k±, k 2 , which he denotes in momentum space by 
RD^v{ki,k 2 )- From a combination of the Callan-Symanzik equation for this three- 
point function, and the Ward identity for the four-point function obtained by in- 
serting a scalar current in this three-point function, Zee establishes the formula 

/ d d\ 

lm— + af3(a)-\R D ^{k u k 2 )(xO((k 1 + k 2 )k 1 k 2 ) . (8) 
Hence expanding in powers of k\ and k 2 , by writing RD^v{k\, k 2 ) — j e M „ pak^k^ + . . . , 
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the coefficient / of the leading term obeys 



TO _ + a/3(a)- ) / = . (9) 
am a J 

Since the dimensionless coefficient / must be independent of m, this implies that 
df I da = 0, and so / has no dependence on a! As Zee notes in his paper, this argu- 
ment extends readily to theories in which there is more than one nonvanishing mass, 
since there are then additional renormalization group equations to give constraints. 
Thus, the renormalization group argument shows that the low energy theorem for 
the naive divergence, and correspondingly the coefficient of the operator anomaly, 
is given by the lowest order triangle contribution, and extends the Adler-Bardeen 
theorem to rule out even non-perturbative renormalizations of the anomaly. 

Recent textbooks often present the anomaly through the elegant path integral 
formulation given by Fujikawa ^2E2j- In this formulation, as applied to QED, one 
starts from a Feynman path integral for the partition function 

d[B^]d^]e- s ^ B ^^ , (10) 

with the electromagnetic potential four-vector and with tp the fermion field. 
One then makes a chiral transformation on the fermion fields, for which the axial- 
vector current is the Noether current, and examines the behavior of the various 
terms in the path integral. Since all variables in the functional integral are classical, 
the change in the mass term in the action S yields just the naive divergence of 
the axial-vector current. Fujikawa's crucial observation is that this is not the end 
of the story, because one also has to calculate the Jacobian associated with the 
chiral transformation of the functional integration measure. Since this measure is 
an infinite product, a regularization is needed to define it, and as a result of the 
regularization, the measure is not a chiral invariant. Within a suitable family of 
regularizations, Fujikawa shows that the Jacobian associated with the chiral trans- 
formation just produces an extra phase factor corresponding to the axial-vector 
anomaly! One can now ask the question, does Fujikawa's method provide a sim- 
ple and direct non-renormalization proof, or is the fact that his calculation gives 
the exact answer a consequence of the fact that the lowest order anomaly is ex- 
act? This question has been examined by Shizuya and Tsukahara [HJi Shizuya 
and Joglekar and Misra [HHj, who show that Fujikawa's result is a one-loop result, 
and that additional input from perturbation theory and/or details of the regulator 
structure is needed to show that the higher order corrections to the anomaly, arising 
formally in the path integral method, all vanish. This conclusion is reinforced by 
the example of the trace anomaly, to be discussed in Section [21 where there are 
higher order (but known) perturbative corrections to the anomaly, with the path 
integral Jacobian argument giving only the leading order term. Brief discussions of 
the relation between the path integral method and the calculation of anomalies to 
all orders are also found in a Festschrift article by Fujikawa 01], and in a recent 



paper of Arkani-Hamed and Murayama Jl] (see also hep-th/9707133) 
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To conclude this section on the quantum electrodynamics or Abelian anomaly, 
we mention a recent interesting paper by Vainshtein |97| . and its extension and 
application to the muon g — 2 calculation by Czarnecki, Marciano, and Vainshtein 
IHZj (see also hep-ph/0310276), and by Knecht, Perrottet, de Rafael, and Peris H"3] 
(for relevant earlier work by these authors, see [U3]). Vainshtein considers the two- 
photon irreducible triangle A^VaVr with one photon soft, and defines an amplitude 
T^ a obtained from this triangle by contracting it with the soft photon polarization 
e T . This two-index tensor amplitude can be divided into longitudinal and transverse 
parts. In the chiral (zero fermion mass) limit, there is no naive divergence and so 
the longitudinal part is given by the anomaly, and is known to all orders. Vainshtein 
points out that in the chiral limit, there is a crossing symmetry relating the vector 
and axial- vector indices \x and <r, since the 75 in the axial vertex with index fi can be 
freely anticommuted through fermion propagators and vector vertices until it is part 
of the external vector vertex with index a. Applying this crossing symmetry to the 
absorptive part of T^ a , together with a dispersion relation argument, implies that 
the transverse part wt of T pcr is equal to one half of the longitudinal part wl, and 
thus also has no perturbative radiative corrections in higher orders. As discussed 
in the second paper cited above, this result has applications to the calculation of 
hadronic corrections to the muon g — 2, in which a triangle diagram is joined to the 
muon line by a virtual photon and a virtual Z intermediate boson. 



2.3. Point Splitting Calculations of the Anomaly. Did Schwinger 
Discover the Anomaly? 

At this point let me backtrack, and discuss the role of point-splitting methods in 
the study of the Abelian electrodynamics anomaly. In the present context, point- 
splitting was first used in the discussion given by Schwinger {S3] of the pseudoscalar- 
pseudovector equivalence theorem, to be described in more detail shortly. Almost 
immediately following circulation of the seminal anomaly preprints in the fall of 
1968, Hagen [J""""|( received Sept. 24, 1968, and a letter to me dated Oct. 16, 1968), 
Zumino 1 1 1 ( ) j (and a letter to me dated Oct. 7, 1968), and Brandt [2*4] ( received 
Dec. 17, 1968, and a letter to me dated Oct. 16, 1968) all rederived the anomaly 
formula by a point-splitting method. Independently, a point-splitting derivation of 
the anomaly was given by Jackiw and Johnson |56| (received 25 November, 1968), 
who explicitly made the connection to Schwinger's earlier work (Johnson was a 
Schwinger student, and was well acquainted with Schwinger's body of work). The 
point of all of these calculations is that the anomaly can be derived by formal 
algebraic use of the equations of motion, provided one redefines the singular product 
■0(x)7 AI 75^(2;) appearing in the axial-vector current by the point-split expression 




and takes the limit x' — * x at the end of the calculation. 
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Responding to these developments, I appended a "Note added in proof" to 
my anomaly paper, mentioning the four field-theoretic, point-splitting derivations 
that had subsequently been given of Eq. Q, and adding "Jackiw and Johnson 
point out that the essential features of the field-theoretic derivation, in the case 
of external electromagnetic fields, are contained in J. Schwinger, Phys. Rev. 82, 
664 (1951)". What to me was an interesting irony emerged from learning of the 
connection between anomalies and the famous Schwinger paper |83| on vacuum 
polarization. I had in fact read Section II and the Appendices of the 1951 paper, 
when Alfred Goldhaber and I, during our senior year at Harvard (1960-61), did a 
reading course on quantum electrodynamics with Paul Martin, which focused on 
papers in Schwinger's reprint volume (Schwinger 84 ). Paul had told us to read 
the parts of the Schwinger paper that were needed to calculate the VV vacuum 
polarization loop, but to skip the rest as being too technical. Reading Section V 
of Schwinger's paper brought back to mind a brief, forgotten conversation I had 
had with Jack Steinberger, who was Director of the Varenna Summer School in 
1967. Steinberger had told me that he had done a calculation on the pseudovector- 
pscudoscalar equivalence theorem for 7r° — ► 77, but had gotten different answers in 
the two cases; also that Schwinger had claimed to reconcile the answers, but that 
he (Steinberger) couldn't make sense out of Schwinger's argument. Jack had urged 
me to look at it, which I never did until getting the Jackiw- Johnson preprint, but 
in retrospect everything fell into place, and the connection to Schwinger's work was 
apparent. 

This now brings me to the question, did Schwinger's paper constitute the discov- 
ery of the anomaly? Both Jackiw, in his paper with Johnson, and I were careful to 
note the connection between Schwinger's paper |83| and the point-splitting deriva- 
tions of the anomaly, once it was called to our attention. However, recently a number 
of Schwinger's former students have gone further, arguing that Schwinger was the 
discoverer of the anomaly and that my paper and that of Bell and Jackiw were 
merely a "rediscovery" of a previously known result. For example, Mehra and Mil- 
ton [2] refer (p. 488) to "...the axial-vector anomaly* which he (Schwinger) had 
discovered in [64] (their reference number for the 1951 paper)", and in the footnote 
labeled by * on the same page say "As we have noted, Stephen Adler, John S. Bell, 
and Roman Jackiw had rediscoverd the anomaly in 1968 " . In a similar vein, when 
I visited the University of Washington a few years ago I noticed a display case that 
had been set up in the theory wing, featuring the 1951 paper and arguing that 
Schwinger was the discoverer of the anomaly. 

With due respect to these opinions of colleagues, I disagree, for a number of rea- 
sons. The essence of the anomaly is that quantization necessarily violates classical 
symmetries, or to borrow a phrase from Bjorken's referee's report quoted above, not 
every "cherished symmetry of the theory" can be maintained. For example, in quan- 
tum theory the pseudovector coupling calculation of 7r° — > 27 decay, which by the 
Sutherland- Veltman theorem gives zero, and the pseudoscalar coupling calculation, 
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which gives nonzero, do not agree, even though classically there is a pseudoscalar- 
pseudovector equivalence theorem. This disagreement arises, as discussed in detail 
above, because the naive axial divergence equation used in proving the equivalence 
theorem is invalid in the quantized theory, as manifested in the anomaly. By con- 
trast, Schwinger's calculation was devoted to making the pseudovector calculation 
give the same non-zero answer as the pseudoscalar one. d He did this by redefin- 
ing the divergence of the axial- vector current as the limit of a point-split, gauge- 
covariant divergence (using his notation here, except that we continue to denote the 
electromagnetic vector potential by B^), 

^[tr 757 ^G(x,x)] = lim - ieB^x')) + + ieB fl (x"))]tr l5lfl G(x' , x") , 

(12) 

with G(x',x") the electron propagator. As noted in Appendix A, this equation is 
simply a rewriting of the pseudoscalar coupling expression —2miv^G(x l 1 x") by use 
of the equations of motion. Thus Schwinger's calculation is an indirect way of cal- 
culating the effective Lagrangian for pseudoscalar coupling, in the process of which 
he derives identities that can also be used to give a point-splitting calculation of 
the anomaly, as discussed, in the context of Zumino's calculation, in Appendix A. 
Although the expression of Eq. 112|) is only gauge covariant, it can be made gauge 
invariant by multiplying by an overall Wilson line factor exp[— ie f x „ dx x B\\ as in 
Eq. lHH , which since it is outside the derivatives reduces to unity in the limit as 
x" — > x'. With the overall Wilson line factor included, Schwinger's formulas are all 
gauge invariant, and one sees that what Schwinger calls the redefined axial- vector 
divergence is in fact, when expressed in the notation used in Section 12.11 not the 
divergence of the gauge-invariant axial vector current, but rather the axial- vector 
current divergence minus the anomaly. In other words, Schwinger's calculation effec- 
tively transposes the anomaly term to the left hand side of Eq. JJ), so that what he 
evaluates is the effective Lagrangian arising from the left hand side of the equation 

d^(x)-^F^(x)F^(x)e^ Tp =2im f(x) , (13) 

which then gives the same result as calculation of an effective Lagrangian from the 
right hand side, which is pseudoscalar coupling. 

The use of a point-splitting method was of course important and fruitful, and 
in retrospect, the axial anomaly is hidden within Schwinger's calculation. But 
Schwinger never took the crucial step of observing that the axial-vector current 
matrix elements cannot, in a renormalizable quantum theory, be made to satisfy all 
of the expected classical symmetries. And more specifically, he never took the step 
of defining a gauge-invariant axial- vector current by point splitting, as in Eq. fljl , 
which has a well-defined anomaly term in its divergence, as in Eq. with the 



d I wish to thank Peter van Nieuwcnhuizcn for emphasizing this point, as well as for stressing that 
Schwinger's calculation is not fully gauge invariant, and for urging me to make precise the relation 
between it and the later point-splitting calculations of the anomaly. 
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anomaly term completely accounting for the disagreement between the pseudoscalar 
and pseudovector calculations of neutral pion decay. So I would say that although 
Schwinger took steps in the right direction, particularly in noting the utility of 
point-splitting in defining the axial- vector current, his 1951 paper obscured the true 
physics and does not mark the discovery of the anomaly. This happened only much 
later, in 1968, and led to a flurry of activity by many people. My view is supported, 
I believe, by the fact that Schwinger's calculation seemed arcane, even to people 
(like Steinberger) with whom he had talked about it and to colleagues familiar with 
his work, and exerted no influence on the field until after preprints on the seminal 
work of 1968 had appeared. 

2.4. The Non-Abelian Anomaly, its Nonrenormalization and 
Geometric Interpretation 

Since in the chiral limit the AVV triangle is identical to an AAA triangle (by a simi- 
lar argument to that explained at the end of Section f2.2l involving anticommutation 
of a 75 around the loop) , I knew already in unpublished notes dating from the late 
summer of 1968 that the AAA triangle would also have an anomaly; a similar obser- 
vation was also made by Gerstein and Jackiw |46| . ^From fragmentary calculations 
begun in Aspen I suspected that higher loop diagrams might have anomalies as well, 
so after the nonrenormalization work with Bill Bardeen was finished I suggested to 
Bill that he work out the general anomaly for larger diagrams. (I was at that point 
involved in other calculations with Wu-ki Tung, on the perturbative breakdown of 
scaling formulas such as the Callan-Gross relation.) I showed Bill my notes, which 
turned out to be of little use, but which contained a very pertinent remark by Roger 
Dashen that including charge structure (which I had not) would allow a larger class 
of potentially anomalous diagrams. Within a few weeks, Bill carried out a brilliant 
calculation, by point-splitting methods, of the general anomaly in both the Abelian 
and the non- Abelian cases (Bardeen |lft|). 

The work discussed in Section 12.31 on the use of point-splitting in the Abelian 
case was known to Bardeen when he started his calculation, and influenced his 
choice of methodology. Rather than proceeding directly from Feynman diagrams 
as in my anomaly paper and that of Bell and Jackiw, he proceeded by combining 
a generalized version of point splitting with the formal S'-matrix expansion. (For 
a subsequent textbook derivation of the general anomaly directly from Feynman 
diagrams, see Chapt. 1 of van Nieuwenhuizen |98|.) Bardeen's ingenious adaptation 
of point-splitting involved using different infinitesimal separations for loops with 
different numbers of vertices. Thus, he started from the interaction Lagrangian 
C{z) — ip(z)T(z)ip(z), with r(z) involving general nonderivative couplings to exter- 
nal scalar, pseudoscalar, vector, and axial- vector fields, and replaced it with 

with n the number of vertices in the loop, and with an average over e to be performed 
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after evaluation of the loop. As explained by Bardeen, "The use of 1/n in the 
definition of the e separation is necessary so that loops with different numbers of 
vertices may be simply related, as needed for the Ward identities." 

Bardeen then defined vector and axial-vector currents by a variation of the S 
matrix with respect to the corresponding external fields, and calculated all of the 
Ward identities involving the smaller spinor loops. These contained singular terms 
and non-minimal "anomaly" terms that were artifacts of the calculational method, 
both of which could be removed by adding local polynomial counterterms to the 
single loop S matrix. What remained after this redefinition were the true anomaly 
terms. Again quoting from Bardeen, "If we required that the vector currents have 
the normal divergences, the divergences of the axial-vector currents contained the 
minimal anomalous terms given in Eq. (49). These anomalous terms were minimal 
in the sense that any further redefinition of the S matrix would cither destroy the 
normal vector-current divergences or simply give additional terms in the anomalous 
axial-vector-current divergences." Expressed in terms of vector and axial-vector 
Yang-Mills field strengths 

F^(x) = d»V u {x) - d v V^{x) - i[V»(x), V v {x)\ - i[A»(x), A v {x)\ , 
F^(x) = d"A u (x) - d v A»{x) - i[V»(x), A v (x)\ - i[A^(x),V u (x)} , (15) 

Bardeen's famous Eq. (49) for the non-Abelian axial- vector anomaly takes the form 

d l± J^(x) = normal divergence term 
+ (l/^)e^ T tr I X<X[(l/4)F^(x)F^(x) + {l/12)F?{x)FT(x) 
+ (2/3)iA f *{x)A u (x)F$ T {x) + {2/3)iF^(x)A a (x)A T (x) + {2/3)iA»{x)F^ {x)A T (x) 
-(8/3)A»(x)A u (x)A' T (x)A T (x)} , (16) 

with trj denoting a trace over internal degrees of freedom, and X\ the internal 
symmetry matrix associated with the axial-vector external field. In the Abelian 
case, with trivial internal symmetry structure, the terms involving two or three 
factors of A IJ,,V '"' vanish by antisymmetry of e^„ CTT , and there are only AW and 
AAA triangle anomalies. When there is non-trivial internal symmetry or charge 
structure, there are anomalies associated with the box and pentagon diagrams as 
well, confirming Dashen's intuition mentioned earlier. Bardeen notes that whereas 
the triangle and box anomalies result from linear divergences associated with these 
diagrams, the pentagon anomalies arise not from linear divergences, but rather from 
the definition of the box diagrams to have the correct vector current Ward identities. 
Bardeen also notes, in his conclusion, another prophetic remark of Dashen, to the 
effect that the pentagon anomalies should add anomalous terms to the PCAC low 
energy theorems for five pion scattering. 

There arc two distinct lines of argument leading to the conclusion that the 
non-Abelian chiral anomaly also has a nonrenormalization theorem, and is given 
exactly by Bardeen's leading order calculation. The first route parallels that used in 
the Abelian case, involving a loop-wise regulator construction in the proof initially 
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given by Bardeen followed by explicit fourth order calculation (Chanowitz 

|25|). an SU(3) analog of the Zee argument using the Callan-Symanzik equations 
(Pi and Shei [7^1 ), and use of dimensional regularization (Marinucci and Tonin 
[7T)j . Costa, Julve, Marinucci, and Tonin [3"Ij). The conclusion in all cases is that 
the Adler-Bardeen theorem extends to the non-Abelian case. Heuristically, what is 
happening is that except for a few small one-fermion loop diagrams, non-Abelian 
theories, just like Abelian ones, are made finite by gauge invariant regularization of 
the gluon propagators. But this regularization has no effect on the chiral properties 
of the theory, and therefore does not change its anomaly structure, which can thus 
be deduced from the structure of the few small fermion loop diagrams for which 
naive classical manipulations break down. For a later, more mathematically oriented 
proof, and further references, see Lucchesi, Piguet, and Sibold |67| . 

The second route leading to the conclusion that the non-Abelian anomaly is non- 
renormalized might be termed "algebraic/geometrical", and consists of two steps. 
The first step consists of a demonstration that the higher order terms in Bardeen's 
non-Abelian formula are completely determined by the leading, Abelian anomaly. 
During a summer visit to Fermilab in 1971, I collaborated with Ben Lee, Sam 
Treiman, and Tony Zee |12| in a calculation of a low energy theorem for the re- 
action 7 + 7^ 7r + 7r + 7rin both the neutral and charged pion cases. This was 
motivated in part by discrepancies in calculations that had just appeared in the 
literature, and in part by its relevance to theoretical unitarity calculations of a 
lower bound on the if£ — * /i + ' \i~ decay rate. Using PCAC, we showed that the fact 
that the 7 + 7 — * 37r matrix elements vanish in the limit when a final ir° becomes 
soft, together with photon gauge invariance, relates these amplitudes to the matrix 
elements F 7 ' for 7 + 7 — * tt° and F 3n for 7 — * 7r° + ir + + tt~ , and moreover, gives a 
relation between the latter two matrix elements, 



Thus all of the matrix elements in question are uniquely determined by ', which 
itself is determined by the AW anomaly calculation. An identical result for the 
same reactions was independently given by Terent'ev (Terentiev) 9Sj. In the mean- 
time, in a beautiful formal analysis, Wess and Zumino 103 showed that the current 
algebra satisfied by the flavor SU(3) octet of vector and axial- vector currents implies 
integrability or "consistency" conditions on the non-Abelian axial-vector anomaly, 
which are satisfied by the Bardeen formula, and conversely, that these constraints 
uniquely imply the Bardeen structure up to an overall factor, which is determined 
by the Abelian AW anomaly. By introducing an auxiliary pseudoscalar field, Wess 
and Zumino were able to write down a local action obeying the anomalous Ward 
identities and the consistency conditions. (There is no corresponding local action 
involving just the vector and axial- vector currents, since if there were, the anomalies 
could be eliminated by a local counterterm.) Wess and Zumino also gave expressions 
for the processes 7 — * 37r and 27 — > 37r discussed by Adler et al and Terentiev, as 



eF 3 * = f- 2 F 



f 



(17) 
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well as giving the anomaly contribution to the five pseudoscalar vertex. The net 
result of these three simultaneous pieces of work was to show that the Bardeen 
formula has a rigidly constrained structure, up to an overall factor given by the 
7T° — > 77 decay amplitude. 

The second step in the "algebraic/geometric" route to anomaly renormalization 
is a celebrated paper of Witten which shows that the Wess-Zumino action has 

a representation as the integral of a fifth rank antisymmetric tensor (constructed 
from the auxiliary pseudoscalar field) over a five dimensional disk of which four 
dimensional space is the boundary. In addition to giving a new interpretation of 
the Wess-Zumino action T, Witten's argument also gave a constraint on the overall 
factor in T that was not determined by the Wess-Zumino consistency argument. 
Witten observed that his construction is not unique, because a closed five-sphere 
intersecting a hyperplane gives two ways of bounding the four-sphere along the 
equator with a five dimensional hemispherical disk. Requiring these two construc- 
tions to give the same value for exp(ir), which is the way the anomaly enters into a 
Feynman path integral, requires integer quantization of the overall coefficient in the 
Wess-Zumino- Witten action. This integer can be read off from the AVV triangle 
diagram, and for the case of an underlying color SU (N c ) gauge theory turns out to 
be just N c , the number of colors. 

To summarize, the "algebraic/geometric" approach shows that the Bardeen 
anomaly has a unique structure, up to an overall constant, and moreover that this 
overall constant is constrained by an integer quantization condition. Hence once the 
overall constant is fixed by comparison with leading order perturbation theory (say 
in QCD), it is clear that this result must be exact to all orders, since the presence 
of renormalizations in higher orders of the strong coupling constant would lead to 
violations of the quantization condition. 

The fact that non-Abelian anomalies are given by an overall rigid structure 
has important implications for quantum field theory. For example, the presence of 
anomalies spoils the renormalizability of non-Abelian gauge theories and requires 
the cancellation of gauged anomalies between different fermion species, through 
imposition of the condition tr{T a , Tjg}T 7 = for all a, (3, 7, with T a the coupling 
matrices of gauge bosons to left-handed fermions. The fact that anomalies have a 
rigid structure then implies that once these anomaly cancellation conditions are 
imposed for the lowest order anomalous triangle diagrams, no further conditions 
arise from anomalous square or pentagon diagrams, or from radiative corrections 
to these leading fermion loop diagrams. A second place where the one-loop geo- 
metric structure of non-Abelian anomalies enters is in instanton physics, where the 
structural match between the instanton action 8ir 2 / g 2 and the anomaly coefficient 
g 2 /(16-7T 2 ) results in the integral of the anomaly over an instanton being an inte- 
ger. This integer can be interpreted as a topological winding number, and through 



e See Gross and Jackiw 51 , Bouchiat, Iliopoulos and Meyer 1231 . and Weinberg 1101 1 . 
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the anomaly leads to a relation between chiral transformations of the fcrmion fields 
ipf — > exp(z75<a;/)^'/, and shifts in the theta angle that describes the gauge field vac- 
uum, 9^9 + 2 a f . The fact that the non-Abelian anomaly is nonrenormalized 
implies that this relation is exact, and has no higher order perturbative corrections/ 
Yet another place where anomaly nonrenormalization plays a role is in the 't Hooft 
anomaly matching conditions (for a survey and references, see Weinberg |1()2| . Sec. 
22.5). 

3. Trace Anomalies and their all Orders Calculation 

In an influential paper Wilson |l()5j proposed the operator product expansion, in- 
corporating ideas on the approximate scale invariance of the strong interactions 
suggested by Mack [HE]- As one of the applications of his technique, Wilson dis- 
cussed 7T° — * 27 decay and the axial- vector anomaly from the viewpoint of the short 
distance singularity of the coordinate space AW three-point function. Using these 
methods, Crewther and Chanowitz and Ellis [23 investigated the short distance 
structure of the three-point function 9V^V V , with 9 = 9^ the trace of the energy- 
momentum tensor, and computed the explicit form of its leading order anomaly. 8 
Their calculations confirmed earlier indications of a perturbative trace anomaly 
obtained in a study of broken scale invariance by Coleman and Jackiw . 

Letting A M „(p) be the momentum space 9V p V l/ three-point function, and n M „ 
be the corresponding V^V^ two-point function, the naive Ward identity A^,,^) = 
(2 — p a d/dp cr )U l j, u (p) is modified to 

(d \ R 
2-p a — JU tlu (p)-— (p„p„ - 7]„vP 2 ) , (18) 

with the trace anomaly coefficient R given by 

R = E + \ E ■ (w) 

i,spini i,spinO 

Thus, for QED, with a single fermion of charge e, the anomaly term is 
— [2a/(3n)](p fl p L/ — -q^p 2 ). In a subsequent paper, Chanowitz and Ellis [2U showed 
that the fourth order trace anomaly can be read off directly from the coefficient of 
the leading logarithm in the asymptotic behavior of ILj„(p), giving to next order 
an anomaly coefficient — 2a/(37r) — a 2 /(27r 2 ). Thus, their fourth order argument 
indicated a direct connection between the trace anomaly and the renormalization 
group [3 function. 



f I have followed here the notation used in the text of Weinberg 11021 . p. 457. I am indebted to 
Hitoshi Murayama for this remark on instanton physics. 

s In Crewther's Eqs. (14) and (15) the trace 9^ is the "naive" trace excluding the electromagnetic 
anomaly contribution, that is, it only contains hadronic fields. I wish to thank Rod Crewther for 
email correspondence on this point. (My comment here is analogous to the comment above that 

8A in Zee's paper rrnsi is the « naive" divergence.) 
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Although not the historical route, the fact that there is a trace anomaly can 
also be inferred by inspection of the Pauli-Villars regulator construction used to 
define gauge invariant fermion loop diagrams. As pointed out by Zumino jlllj and 
by Jackiw |55j . the axial- vector anomaly can be given a simple interpretation this 
way. Introducing a Pauli-Villars fermion of mass Mo, the vacuum to two photon 
matrix element of the regularized axial-vector divergence equation takes the form 

(0|d^| 7 7) mo ~ (0|d^|77) Mo = 2zm (0|j 5 |77> m ° - 2zM (0|j 5 |77> M ° • (20) 

The left hand side of this equation approaches the regularized axial-vector diver- 
gence as Mo — > oo. The first term on the right hand side is well defined by itself and 
gives the matrix element of the naive axial- vector divergence. However, gauge invari- 
ance combined with dimensional analysis (or direct calculation) shows that for large 
M , we have (0|j 5 | 7 7) M ° oc Af _1 (0\F^ a F rp e icrTp \^) M ° , and so the second term on 
the right approaches a nonzero limit as Mq — > oo, and gives the anomaly contri- 
bution. Let us now apply a similar analysis to the vacuum to two photon matrix 
element of the energy-momentum tensor, defined in a similar way by Pauli-Villars 
regularization, 

(0|^| 7 7) mo - (0|^|77) M) = mo<0|j|ry) mo - M (0|j| 7 7) Wd . (21) 

with j = ipip the scalar current. With the imposition of gauge invariance (to elimi- 
nate quadratic divergences), the left hand side of this equation approaches the regu- 
larized trace of the energy-momentum tensor, and the first term on the right is well 
defined and gives the matrix element of the naive trace. However, for the scalar cur- 
rent, gauge invariance combined with dimensional analysis shows that for large Mo, 
we have (0|j|77) M ° oc M ~ 1 (0|-F po 'i 7 'p Cr |77) Mo , and so the second term on the right 
approaches a nonzero limit as M — > oo, and gives the trace anomaly. Pursuing the 
details by computing the proportionality constants, one sees that Eqs. H2()I21|I give 
the correct anomaly coefficients in the two cases. This way of deriving the trace 
anomaly shows clearly that it is a scalar analog of the pseudoscalar axial-vector 
anomaly, a fact that will figure in the discussion of anomalies in supersymmetric 
theories. 

My involvement with trace anomalies began roughly five years later, when Phys- 
ical Review sent me for refereeing a paper by Iwasaki [54| . In this paper, which noted 
the relevance to trace anomalies, Iwasaki proved a kinematic theorem on the vacuum 
to two photon matrix element of the trace of the energy-momentum tensor, that is 
an analog of the Sutherland- Veltman theorem for the vacuum to two photon matrix 
element of the divergence of the axial- vector current. Just as the latter has a kine- 
matic zero at q 2 = 0, Iwasaki showed that the kinematic structure of the vacuum to 
two photon matrix clement of the energy-momentum tensor implies, when one takes 
the trace, that there is also a kinematic zero at q 2 = 0, irrespective of the presence 
of anomalies (just as the Sutherland- Veltman result holds in the presence of anoma- 
lies). Reading this article suggested the idea that just as the Sutherland- Veltman 
theorem can be used as part of an argument to prove nonrenormalization of the 
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axial-vector anomaly, Iwasaki's theorem could be used to analogously calculate the 
trace anomaly to all orders. 11 During the spring of 1976 I wrote an initial preprint 
attempting an all orders calculation of the trace anomaly in quantum electrody- 
namics, but this had an error pointed out to me by Baqi Beg. Over the summer of 
1976 I then collaborated with two local postdocs, John Collins (at Princeton) and 
Anthony Duncan (at the Institute), to work out a corrected version (Adler, Collins, 
and Duncan jll|). Collins and Duncan simultaneously teamed up with another Insti- 
tute postdoc, Satish Joglekar, to apply similar ideas to quantum chromodynamics, 
published as Collins, Duncan, and Joglekar [SB], and simultaneously the same result 
for QCD was obtained by N. K. Nielsen [73| . 

In the simpler case of QED, a sketch of the argument based on Iwasaki's theorem 
goes as follows. Let us write the trace of the energy-momentum tensor as 

9£ = Kxma^l) + K 2 N[F X<J F^} + (22) 

with ... denoting terms that vanish by the equations of motion, and with N[F\ a F Xa ] 
specified by the conditions that its zero momentum transfer matrix elements be- 
tween two electron states, and between the vacuum and a two photon state, are 
given by the corresponding tree approximation matrix elements of the operator 
Z^ 1 F\ a F Xa . Taking matrix elements of 6£ between electron states at zero momen- 
tum transfer then gives the condition 

Ki(e(p)\m W\e(p)) = m . (23) 

However, an analysis using the Callan-Symanzik equation for the electron propaga- 
tor shows that 

(e(p)\m ^\e(p)) = , (24) 

1 + 6(a) 

with I + 6(a) = (m/mo)dmo/dm, so that K\ is given by K\ = l + 5(a). Now taking 
the vacuum to two photon matrix element of Ofi at zero momentum transfer, and 
using Iwasaki's theorem and the defining condition for N[F\ (T F > ' a '], we get a second 
condition 

= [l + 6(a)](0\m ^\ 1 (p) 1 (-p)) 
+ K 2 {G\Z^F x<r F x "\ 1 (jp) 1 (-p)) tTee . (25) 

However, analysis of the Callan-Symanzik equation for the photon propagator shows 
that 

(0|m ^|7(p)7(-p)) = ~\ i^s(a) 
x(0|Z 3 - 1 F Aff F ACT |7b)7(-p))trcc , (26) 



h In addition to writing a favorable report on Iwasaki's paper, I invited him to spend a year at the 
Institute for Advanced Study, which he did during the 1977-78 academic year. 
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with /3(a) defined by /3(a) = (m/a)da/dm. Hence Ki is given by if 2 = j(3(a), and 
so the final result for the trace equation is 

0£ = [l + S(a)}m ^ + lp(a)N[F Xa F^} + ... . (27) 

The first two terms in the power series expansion of the coefficient of the F\ a F XcT 
term in the trace agree with the fourth-order calculation of Chanowitz and Ellis. The 
trace equation in QCD has a similar structure, again with the /3 function appearing 
as the anomaly coefficient. 

4. Epilogue: Chiral and Trace Anomalies in Supersymmetric 
Yang-Mills Theories 

In Section[2]we saw that the chiral anomaly is given exactly by the one loop calcula- 
tion, while in Section [31 we saw that the trace anomaly can be calculated exactly in 
terms of the renormalization group /3-function. These two results have figured in a 
large literature dealing with anomalies in supersymmetric theories, and in particu- 
lar in supersymmetric Yang- Mills theory, which is the case we shall focus on here (a 
similar analysis can be given when supersymmetric matter fields are included). We 
shall not attempt to cite all of the relevant papers, but note that reviews as of the 
mid-1980s were given in the comprehensive paper of Shifman and Vainshtein 89 , 
and also in the introductory section of Ensign and Mahanthappa [21] • Important 
recent discussions include Shifman's Sakurai Prize lecture jHH] and the analysis of 
Arkani-Hamed and Murayama 14 . Although historically the topic of supersym- 
metric anomalies initially generated much confusion, as I understand it the current 
status of the subject can be summarized as follows. 

(1) In supersymmetric Yang-Mills theory at the classical level, there is a supercur- 
rent containing among its components both the axial-vector current and the 
energy momentum tensor (Ferrara and Zumino |4(Jj: Piguet and Sibold [77] ). 
Thus, at the quantum level, where anomalies appear, one might expect there to 
be a relation between the axial- vector anomaly and the trace anomaly (Grisaru 
|4*7| : Piguet and Sibold HE]). In fact, since we have seen that the scalar trace 
anomaly is an analog of the pseudoscalar chiral anomaly, and since chiral super- 
multiplets in supersymmetry contain scalar and pseudoscalar fields on an equal 
footing, we should expect there to be an anomaly chiral supermultiplet. 1 How- 
ever, even in the supersymmetric case there is an axial-vector current that obeys 
the Adler-Bardeen theorem (Clark, Piguet, and Sibold [21] ; Jones and Leveille 
Piguet and Sibold [70]; Ensign and Mahanthappa [33]). whereas a number 

'All components of the supercurrent have anomalies. For example, the one-loop anomaly in the 
spinor supersymmetry current was obtained by Curtright and the one-loop supercurrent 

anomalies for the Wess-Zumino model were obtained by Clark, Piguet, and Sibold 1281 . For a 
summary of the one-loop supercurrent anomalies in component field form, and further references, 
see Appendices B and C of Adler [7| and Chapter 20 of West 11041 . 
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of calculations have shown that there are higher loop contributions to the trace 
anomaly (Jones [HOI; Novikov, Shifman, Vainshtein, and Zakharov Grisaru 
and West [SOj)- These authors gave differing arguments (with differing degrees 
of validity - see Novikov, Shifman, Vainshtein, and Zakharov |7S|) for what is 
now called the NSVZ /3 function, given by Eq. {28) below. Thus, there seems to 
be a potential conflict between supersymmetry and the all orders calculations 
of anomalies. 

(2) The resolution to the potential conflict consists in noting that there is more 
than one consistent choice of subtraction scheme for the renormalized currents 
and their anomalies (Jones, Mezincescu, and West [H2]; Grisaru, Milewski, and 
Zanon ^HlEHj)- Thus, there are two manifestly supersymmetric supercurrents, 
which coincide at the classical level, one of which satisfies the Adler-Bardeen 
theorem, with a one-loop anomaly, and the other which has an anomaly given by 
the multiloop (3- function. The paper of Jones, Mezincescu, and West shows that 
the relation between the two supercurrents involves two functions of coupling 
which they call a (g) and b(g); when one assumes that these two functions are 
equal one can derive a formula for the (3 function to all orders, 



with /jW the one-loop (3 function. The result of Eq. Ij28(l is what was found in 
the instanton calculation of Novikov, Shifman, Vainshtein, and Zakharov [71] 
and also in an earlier calculation of Jones jHOl ! however, this earlier Jones result 
cannot be regarded as an alternative derivation of the NSVZ (3 function, since 
it has implicitly assumed a(g) = b{g). 
(3) We note that if it were not for the denominator 1 + |/3W(g)/g, Eq. H28JI would 
be saturated by the one- loop result. Thus Eq. H28[) can be regarded as a renor- 
malization of a one- loop result, and as such, it is simpler in structure than 
the (3 functions found in the non-supersymmetric case, where the chiral and 
trace anomalies are unrelated (witness the factor of 1/3 in the trace anomaly 
coefficient) even in leading one-loop order. This is the basis for the modern 
interpretation of the supersymmetric "anomaly puzzle" . According to this in- 
terpretation, first proposed by Shifman and Vainshtein |89j . the supercurrent 
generalizing the Adler-Bardeen theorem has an anomaly given by the one loop 
term in Eq. (|28l) , whereas the supercurrent generalizing the trace anomaly dis- 
cussed in Section [31 has the (3 function of Eq. I|28|l . including the renormalizing 
denominator. These two differing supercurrents in fact correspond to two dif- 
ferent choices of coupling constant appropriate to two different calculational 
schemes, with the renormalizing denominator arising from the transformation 
between the two calculational schemes. If one uses a Wilsonian effective action, 
in which only virtual momenta greater than some (large) minimum fj, are kept, 



Pig) 



(3^{g) 



l + |/? (1) (.9)/3 ' 
-3C 2 (G).g 3 /(16^ 2 ) 



(28) 
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then the supercurrent anomaly is exhausted at one loop order. On the other 
hand, if one uses a one particle irreducible effective action with a canonically 
normalized coupling constant, the associated coupling renormalization leads to 
a supercurrent anomaly given by the NSVZ j3 function of Eq. Ij28(l . 
(4) Arkani-Hamed and Murayama |14j have given a very interesting interpretation 
of the two possible supercurrents and anomalies in terms of how one introduces 
the bare coupling in the Lagrangian, working entirely within a Wilsonian action 
framework, without reference to the one particle irreducible effective action 
invoked by Shifman and Vainshtein. (For a related commentary, see Shifman 
|88|.) If one uses a manifestly holomorphic coupling definition 

C = -kl d 2 e\[W a {V)] 2 + adjoint , (29) 

with 1/g 2 , = 1 / g 2 + i9 / (8tt 2 ) , and V — the vector superfield, holomorphicity 
arguments of the type used to prove supersymmetry nonrenormalization theo- 
rems show that the j3 function is exhausted at one-loop order, in agreement with 
the Shifman- Vainshtein construction based on the Wilsonian effective action. 
This corresponds to the definition of the supercurrent multiplet that contains 
an axial-vector current obeying the Adler-Bardeen theorem. On the other hand, 
one can instead use a "canonical" coupling, 

defined as one that gives the kinetic energy a coupling-independent coefficient. 
This is not holomorphic in g c because of the self-adjoint restriction on the 
superfield argument g c V of the field-strength superfield W a . Since transform- 
ing from the holomorphic coupling to the canonical coupling involves a field 
rescaling, there is a Jacobian associated with this transformation, that can be 
computed in analogy with the non-supersymmetric Fujikawa @3] trace anomaly 
calculation and the later supersymmetric Konishi-Shizuya |65j anomaly calcula- 
tion. Arkani-Hamed and Murayama show that this Jacobian precisely accounts 
for the difference between the one loop j3 function and the NSVZ (3 function of 
Eq. (|28ll , corresponding to the transformation given by Shifman and Vainshtein, 



„2 = Re 72 ) lo &9c , (31) 



from the Wilsonian to the one particle irreducible effective action. A standard 
result of the renormalization group states that under analytic transformations 
of the coupling constant, the one- and two-loop /3 functions are invariant, with 
changes appearing only at three- and higher loop order. However, because the 
transformation of Eq. (|3 1|) has a non-analytic, logarithmic dependence on g c , 
it can transform the one-loop beta function /JW into the NSVZ (3 function, 
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which has a nonvanishing two-loop term: letting A be the cutoff, differentiating 
Eq. J3U implies that 



which with 0(g h ) = -3C 2 (G)gl/ (16tt 2 ) implies Eq. (gSJ for (3(g c ). Although we 
have seen that without further input, functional integral Jacobian calculations 
may give only leading order results, Arkani-Hamed and Murayama give a sec- 
ond method of indirectly determining the Jacobian by using the finiteness of 
N = 2 supersymmetric theories, which shows that the one-loop answer for the 
Jacobian is exact, a conclusion also reached in the earlier paper of Shifman and 
Vainshtein. This thus closes the gap of showing that a(g) — b(g) left unresolved 
in the work of Jones, Mezincescu, and West, and gives an alternative all orders 
derivation of the NSVZ result, and explains in a simple way why the NSVZ 
formula has the structure of a one-loop (3 function up to a renormalizing factor. 

To sum up, there now appears to be a detailed understanding of the role played 
by all orders calculations of chiral and trace anomalies in supersymmetric gauge the- 
ories. In retrospect, the "supersymmetric anomaly puzzle" stemmed from the initial, 
preliminary generalization from known non-supersymmetric results to the super- 
symmetric case, which suggested that one should find a unique supercurrent with a 
one-loop anomaly structure. What one finds, instead, is an equivalence class of su- 
percurrents, differing by the coupling constant definition, with a one-loop anomaly 
structure modulo renormalizations arising from redefinitions of the coupling con- 
stant - different from first expectations, but not that far off. This contrasts with 
the all orders anomalies found in QED and QCD, where the trace anomaly is given 
by a (3 function that is unrelated to the chiral anomaly even at one-loop order, 
reflecting the fact that these theories are not supersymmetric. 

To conclude, I note that I have only touched in this essay on one part of a 
large subject. For further, detailed discussions of anomalies, see the books of van 
Nieuwenhuizen [SB] and Bertlmann 120], as well as chapters or sections of the books 
by Weinberg |102j . Makeenko [HU], Volovik |100| . and Zee |109| . and of a forthcoming 
book by van Nieuwenhuizen.- 1 
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Appendix A. Some Details of the Zumino and Schwinger 
Calculations 

I give here a few key formulas from the Zumino |ll()j point-splitting calculation, 
that allow one to understand what is being done in the Schwinger |SSj calculation. 
I follow the notation used by both Zumino and Schwinger, and so in this Appendix 
denotes the electromagnetic vector potential, that was denoted by in the 
text, and the metric conventions also differ from those used in the text. Zumino's 
treatment starts from the formula 

[0 M - ieA„{x) + + ieA^x')]tr l5 ^G{x, x') = -2mtr 75 G(x, x') , (A.l) 

which is immediately obtained from the electron Dirac equation in the presence 
of an electromagnetic field, and the definition of the electron Green's function G. 
Zumino defines a gauge invariant axial-vector current by the x — ► x' limit of the 
point-split expression 

tr757 AI G(x, x 1 ) exp[— ie f A-dx] . (A.2) 

J x' 

(Note that this is opposite in sign to the axial-vector current used above in the 
text, because both Zumino and Schwinger define the axial- vector current with the 
ordering 757^, whereas in the text we use the ordering 7^75.) To calculate the 
divergence of this formula, he uses the easily derived identity 

[dp - ieA^(x) + d^ + ieA M (x')] exp[ie /* A ■ dx] 
= <^[iefi l A'dx][d li + d , li +ie(x-x , ) u F lw ] , (A.3) 

with = d^A v — d v Afj, the field strength evaluated at the midpoint of the Wilson 
line. Rewriting this formula as 

exp[-ie f* A ■ dx][<9 M - ieA^x) +d^+ ieA^x')] 

= + d 'n + ie ( x ~ x'YFuv] exp[-ie /* A-dx] , (A.4) 

one finds by use of Eqs. (|A.1(I and (|A. 21) that 

[fy + d'^+ ie(x - x'YF^]tv^YG{x, x') exp[-ze f* A ■ dx] 

= — 2mti^ 5 G(x, x') exp[— ie f x , A-dx] . (A. 5) 

The right hand side of Eq. (|A. 5|) is the point-split definition of the naive divergence. 
The left hand side is a sum of the point-split definition of the ordinary divergence 
of the gauge- invariant point-split axial- vector current, and an extra term that gives 
the anomaly. The anomaly term is readily evaluated by using the formula given by 
Schwinger and Zumino for the small x — x' behavior of the Green's function, 

ie - (x- x') x f x 

tr757 M G(x,x') ~ — ^exp[ie / A-dx] , (A.6) 

2tt z (x — x'Y J x , 

with F^ = \e ll \ pa F f " J the dual electromagnetic field strength. 



:-!() 



S. Adler 



Schwinger's calculation consists of an evaluation of the left hand side of Eq. (|A.5|) 
by first using Eq. (jA. Bf) (which is derived in Schwinger's Eqs. (5.17)-(5.21)), and 
then using Eq. (|A.3|I (which is applied in Schwinger's Eq. (5.22)). As noted in the 
text, Schwinger's identification of a gauge-covariant axial-vector divergence is in- 
correct - apart from the overall minus sign noted above, the object that he so labels 
is actually (up to a Wilson line factor) the gauge-invariant axial-vector divergence, 
minus the axial-vector anomaly term. There is no gauge-invariant axial-vector cur- 
rent for which this combination is the divergence, but as shown in Eqs. (58) and (59) 
of Adler 0, one can readily write down a gauge-non-invariant axial- vector current 
which has this divergence. 
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